MODERN APPROACHES TO MULTICLASS INTENT CLASSIFICATION BASED ON PRE-TRAINED TRANSFORMERS
Annotation
Subject of Research. The paper considers modern approaches to the multiclass intention classification problem. The user intention is the incoming user requests when interacting with voice assistants and chatbots. The algorithm is meant for determination what class the call belongs to. Modern technologies such as transfer learning and transformers can improve significantly the multiclass classification results. Method. This study uses a comparative model analysis technique. In turn, each model is inlined into a common pipeline for data preparing and clearing, and the model training but with regard to its specific requirements. The following models applied in real projects have been selected for comparison: Logistic Regression + TF-IDF, Logistic Regression + FastText, LSTM + FastText, Conv1D + FastText, BERT, and XLM. The sequence of models corresponds to their historical origin, but in practice these models are used without regard to the time period of their creation but depending on the effectiveness of the problem being solved. Main Results. The effectiveness of the multiclass classification models on real data is studied. Comparison results of modern practical approaches are described. In particular, XLM confirms the superiority of transformers over other approaches. An assumption is made considering the reason why the transformers show such a gap. The advantages and disadvantages of modern approaches are described. Practical Relevance. From a practical point of view, the results of this study can be used for projects that require automatic classification of intentions, as part of a complex system (voice assistant, chatbot or other system), as well as an independent system. The pipeline designed during the study can be applied for comparison and selection of the most effective model for specific data sets, both in scientific research and production.
Keywords
Постоянный URL
Articles in current issue
- READER’S NOTE
- BLOCKCHAIN TECHNOLOGY IN 5G NETWORKS
- LASER HEATING NUMERICAL SIMULATION OF TITANIUM-CONTAINING OPTOTHERMAL FIBER CONVERTER AND VEIN WALL DURING ENDOVASALLASER COAGULATION
- EXPERIMENTAL STUDY OF OPTICAL POWER EFFECT ON THE STRUCTURE OF FIBER-OPTICAL DIFFUSER OBTAINED BY FIBER CORE MELTING
- METHODS OF HOT WIRE CREATION FOR FIBER-OPTICAL THERMAL ANEMOMETER
- EFFECT OF EXTERNAL RELATIVE PRESSURE ON PHASE SHIFT IN SAGNAC INTERFEROMETER
- ABSORPTION CHARACTERISTICS OF SILVER ION-EXCHANGED LAYERS IN CHLORIDE PHOTO-THERMO-REFRACTIVE GLASS
- ERBIUM SPECTRAL-LUMINESCENT CHARACTERISTICS IN BROMIDE-FLUORIDE PHOTO-THERMO-REFRACTIVE GLASSES
- AUTOMATED HANDDETECTION METHOD FOR TASKS OF GESTURE RECOGNITION IN HUMAN-MACHINE INTERFACES
- EFFECTIVE IMPLEMENTATION OF MODERN MCELIECE CRYPTOSYSTEM ON GENERALIZED (L,G)-CODES
- NORMALIZATION OF KAZAKH LANGUAGE WORDS
- IMAGE-BASED DEFECT ANALYSIS FOR 3D-PRINTED ITEM SURFACE USING MACHINE LEARNING METHODS
- MODEL OF AUTOMATED SYNTHESIS TOOL FOR HARDWARE ACCELERATORS OF CONVOLUTIONAL NEURAL NETWORKS FORPROGRAMMABLE LOGIC DEVICES
- PACKET RESERVATIONS IN REAL-TIME MULTIPATH TRANSMISSIONS
- MODULAR APPROACH APPLICATION IN DEVELOPMENT OF COMPUTER NUMERICAL CONTROL SOFTWARE
- PARAMETRIC IDENTIFICATION OF DIFFERENTIAL-DIFFERENCE MODELS OF HEAT TRANSFER IN ONE-DIMENSIONAL BODIES BASED ON KALMAN FILTER ALGORITHMS
- SELECTION OF COMPOSITE MATERIAL IN ELECTROMAGNETIC LOG SENSOR
- MODELING OF LIQUEFIED NATURAL GAS EVAPORATION IN MOBILE RESERVOIRS
- MATHEMATICAL MODEL OF LIQUEFIED NATURAL GAS EVAPORATION AND ANALYSIS OF ORIGINAL COMPOSITION EFFECT ON EVAPORATION SPEED
- MESHLESS MODELING OF ELASTIC DEFORMATIONS OF POLYMERIC COMPOSITE MATERIALS UNDER STATIC LOADING
ACCURACY INCREASE OF SOFTWARE AND HARDWARE APPLIANCE FOR MUSCLE ACTIVITY MEASURING AND MONITORING BY FILTRATION OF CARRIER COMPONENT AND FREQUENCIES HIGHER THAN MEASURED SIGNAL RANGE